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Abstract 

We introduce a Three Tier Tree Calculus (T 3 C) that defines in a systematic way three tiers of tree 
structures underlying proof search in logic programming. We use T 3 C to define a new - structural - 
version of resolution for logic programming. 
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1 Introduction 

As ICLP is celebrating the 200 r/ ' anniversary of George Boole, we are reflecting on the 
fundamental “laws” underlying derivations in logic programming (LP), and making an 
attempt to formulate some fundamental principles for first-order proof search, analogous 
in generality to Boole’s “laws of thought” for propositional logic dBoole 1854b . 

Any such principles must be able to reflect two important features of first-order proof 
search in LP: its recursive and non-deterministic nature. For this they must satisfy two 
criteria: to be able to (a) model infinite structures and (b) reflect the non-determinism of 
proof search, relating “laws of infinity” with “laws of non-determinism” in LP. 

Example 1.1 The program P\ inductively defines the set of natural numbers: 

0. nat(O) 

1. nat(s(X)) <— nat(X) 

To answer the question “Does Pi h nat(s(X)) hold?”, we first represent it as the LP query 
? nat(s(X)) and then use SLD-resolution to resolve this query with P\. The topmost 
clause selection strategy first resolves nat(s(X)) with P\ ’s second clause (Clause 1), and 
then resolves the resulting term with P\’s first clause (Clause 0). This gives the derivation 
nat(s(X)) —> nat(X) —» true, which computes the solution {X i —> 0} in its last step. So 
one answer to our question is “Yes, provided X is 0.” 
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Even for this simple inductive program, there will be clause selection strategies (or clause 
orderings) that will result in infinite SLD-derivations. If Clause 1 is repeatedly resolved 
against, the infinite computation will compute the first limit ordinal. 


The least and greatest Herbrand model semantics ( Ivan Emden and Kowalski 19761 Lloyd 1988 


Ivan Emden and Abdallah 1985i captured very well the recursive (and corecursive!) nature 
of LP (thus satisfying our criterion (a)). For example, the least Herbrand model for P\ is 
an infinite set of finite terms nat(O), nat(s(0)), nat(s(s(0))),.... The greatest complete 
Herbrand model for program P\ is the set containing all of the finite terms in the least Her¬ 
brand model for Pi together with the first limit ordinal nat(s(s(...))). However, due to its 
declarative nature, the semantics does not reflect the operational non-deterministic nature 
of LP, and thus fails our criterion (b). 


The operational semantics of LP has seen the introduction of a variety of tree struc¬ 
tures reflecting the non-deterministic nature of proof search: proof trees, SLD-derivation 
trees , and and-or-trees, just to name a few. However, these do not adequately capture 
the infinite structures arising in LP proof search. It is well-known that SLD-derivations 
for any program P are sound and complete with respect to the least Herbrand model for 
P (Lloyd 1988), but this soundness and completeness depends crucially on termination of 
SLD-derivations, and termination is not always available in LP proof search. As a result, 
logical entailment is only semi-decidable in LP. 

In one attempt to match the greatest complete Herbrand semantics for potentially non¬ 
terminating programs, an operational counterpart — called computations at infinity — was 
introduced in ( Lloyd 1988[lvan Emden and Abdallah 19851 1. The operational semantics of 
a potentially nonterminating logic program P was then taken to be the set of all infinite 
ground terms computable by P at infinity. Computations at infinity better capture the com¬ 
putational behaviour of non-terminating logic programs, but infinite computations do not 
result in implementations. This observation suggests one more criterion: (c) our operational 
semantics must be able to provide an observational (constructive) approach to potential in¬ 
finity and non-determinism of LP proof search, thus incorporating “laws of observability”. 

Coinductive logic programming (CoLP) (Gupta et al. 2007 Si mon et al. 2 007) provides 
a method for terminating certain infinite SLD-derivations (thus satisfying our criteria (a) 
and (c)). This is based on the principle of coinduction, which is in turn based on the ability 
to finitely observe coinductive hypotheses and succeed when coinductive conclusions are 
reached. CoLP’s search for coinductive hypotheses and conclusions uses a fairly straight¬ 
forward loop detection mechanism. It requires the programmer to supply annotations clas¬ 
sifying every predicate as either inductive or coinductive. Then, for queries marked as 
coinductive, it observes finite fragments of SLD-derivations, checks them for unifying sub¬ 
goals, and terminates when loops determined by such subgoals are found. 

The loop detection mechanism of CoLP has three major limitations, all arising from the 
fact that it has relatively week support for analysis of various proof-search strategies and 
term structures arising in LP proof search (and thus for our criterion (b)). 


(1) It does not work well for cases of mixed induction-coinduction. For example, to 
coinductively define an infinite stream of Fibonacci numbers, we would need to include in¬ 
ductive clauses defining addition on natural numbers. Coinductive goals will be mixed with 
inductive subgoals. Closing such computations by simple loop detection is problematic. 
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(2) There are programs for which computations at infinity produces an infinite term, 
whereas CoLP fails to find unifiable loops. 

Consider the following (coinductive) program Pi that has the single clause 


0. f rom(X,scons(X,Y)) from(s(X),Y) 

Given the query ? from(0,X), and writing as an abbreviation for the stream con¬ 
structor scons, we have that the infinite term t' = from(0, [0, [s(0), [s(s(0)),...]]]) is 
computable at infinity by Pi and is also contained in the greatest Herbrand model for 
Pi. However, Pi b f rom(0, X) cannot be proven using the unification-based loop detection 
technique of CoLP. Since the terms from(0, scons(0,X')), from(s(0), scons(s(0),X")), 
from(s(s(0)), scons(s(s(s(0))),X ,,, ) ) ... arising in the derivation for P 2 and ? <—f rom(0,X) 
will never unify, CoLP will never terminate. 

(3) CoLP fails to reflect the fact that some infinite computations are not productive, 
i.e., do not produce an infinite term at infinity. The notion of productivity of corecursion 
is well studied in the semantics of other programming languages (lEndrullis et al. 20T0t 
Agda 20151 Coq 2015). For example, no matter how long an SLD-derivation for the fol¬ 


lowing program P 3 runs, it does not produce an infinite term, and the resulting computation 
is thus coinductively meaningless: 

0. bad(X) bad(X) 


Somewhat misleadingly, CoLP’s loop detection terminates with success for such programs, 
thus failing to guarantee coinductive construction of infinite terms (failing criterion (a)). 

Is our quest for a theory of LP satisfying criteria (a), (b), and (c) hopeless? We take a 
step back and recollect that the semantics of first-order logic and recursive schemes offers 
one classical approach to formulating structural properties of potentially infinite first-order 
terms. Best summarised in “Fundamental Properties of Infinite Trees” (ICourcelle 1983k 
the approach comes down to formulating some structural laws underlying first-order syn¬ 
tax. It starts with definition of a tree language as a (possibly infinite) set of sequences of 
natural numbers satisfying conditions of prefix-closedness and finite branching. Given a 
first-order signature E together with a countable set of variables Var, a first-order term tree 
is defined as a map from a tree language L to the set E U Var. Size of the domain of the 
map determines the size of the term tree. The “laws” are then given by imposing several 
structural properties: (i) in a given term tree, arities imposed by E must be reflected by 
the branching in the underlying tree language; (ii) variables have arity 0 and thus can only 
occur at leaves of the trees; and (iii) the operation of substitution is given by replacing 
leaf variables with term trees. A calculus for the operation can be formulated in terms of a 
suitable unification algorithm. We give formal definitions in Sections[2]and[3] 

We extend this elegant theory of infinite trees to give an operational semantics of LP 
that satisfies criteria (a), (b), and (c). We borrow a few general principles from this theory. 
Structural properties of trees (given by arity and variable constraints) and operations on 
trees (substitutions) are defined by means of “structural laws” that hold for finite and infi¬ 
nite trees. This gives us constructive approach to infinity (cf. criteria (a) and (c)). It remains 
to find the right kind of structures to reflect the non-determinism of proof search in LP. 

Given a logic program P and a term (tree) t, the first question we may ask is whether 
t matches any of P’s clauses. First-order term matching is a restricted form of unification 
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? 4 — nat(s(X)) ? 4 — nat(s(0)) 

nat(s(X)) nat(s(0)) 

X\ nat(s(X)) <— nat(X) Xi nat(s(0)) <— nat(O) 


nat(X) nat(O) 

/ \ / \ 

X 2 X% nat(O) 4 — X 3 


Fig. 1. The rewriting trees for P\ and ? <— nat(s(X)) and ? «— nat(s(0)). The trees form a transition relative 
to the Tier 2 variable X 2 (shown by —i). The second tree is a successful proof for ? <— nat(s(X))). 


employed in (first-order) term rewriting systems (TRS) dTerese 20031 ) and — via pattern¬ 
matching — in functional programming. For our P and t , we may proceed with term match¬ 
ing steps recursively, mimicking an SLD-derivation in which unification is restricted to 
term matching. Consider the matching sequences for four different terms and the coinduc- 
tive program Pj from above: 

from(0,X) from(0, [0,X']) from(0, [0, [s(0),X"]]) from(0, [0, [s(0). [s(s(0)).X m ]]]) 

from(s(0),X') from(s(0), [s(0),X"]) from(s(0), [s(0), [s(s(0)),X m ]]) 

f rom(s(s(0)),X") f rom(s(s(0)), [s(s(0)),X"']) 

f rom(s(s(s(0))),X"') 

Let us call term matching sequences as above rewriting trees , to highlight their relation 
to TRS. The above sequences can already reveal some of the structural properties of the 
given logic program. If £2 is the signature of the program P 2 , and if we denote all finite 
term trees that can be formed from this signature as Term(Ei), then a rewriting tree for Pi 
can be defined as a map from a given tree language L to Term ( £ 2 ). Since rewriting trees 
are built upon term trees, we may say that term trees give a first tier of tree structures, while 
the rewriting trees give a second tier of tree structures. To formulate suitable laws for the 
second tier, we need to refine our notion of rewriting trees. 

Given a program P and a term t, we may additionally reflect how many clauses from P 
can be unified with t , and how many terms those clauses contain in their bodies. We thus 
introduce a new kind of “or-nodes” to track the matching clauses. If P has n clauses, t may 
potentially have up to n alternative matching sequences. When a clause i does not match 
a given term tree f, we may use a Tier 2 variable to denote the fact that, although t does 
not match clause i at the moment, a match may be found for some instantiation of t. Thus, 
for the program P\ above and the queries ? •<— nat(s(X)) and ? nat(s(0)), we will have 

the two rewriting trees of Figure[j] We note the alternating or-nodes (given by clauses) and 
and-nodes (given by terms from clause bodies) and Tier 2 variables. 

Two kinds of laws are imposed on structure of rewriting trees: 

- arity constraints: the arity of an and-node is the number of clauses in the program, 
and arity of and or-node is the number of terms in its clause body. 

- variable constraints: variable leaves have arity 0, and run over the objects being de¬ 
fined (rewriting trees). Variables are the leaves in which substitution can take place. 

In Figure [T] Tier 2 variable Xi is substituted by a one-node rewriting tree nat(0) . 
Such substitutions constitute the fundamental operation on Tier 2 trees, and give rise to a 
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calculus for Tier 2 given in terms of so-called rewriting tree transitions. Figure Q] shows a 
transition from a rewriting tree for ? nat(s(X)) to a rewriting tree for ? nat(s(0)) 

which corresponds to the SLD-derivation outlined in Example 11.11 Thus, a derivation is 
a sequence of tree transitions (given by the Tier 2 operation of substitution). We call this 
method structural resolution, or S-resolution for short. Its formal relation to TRS and type 
theory is given in (Fu and Komendantskaya 2015). Section[4]will introduce Tier 2 formally. 

We note the remarkably precise analogy between structures and operations of Tier 1 
and Tier 2. Rewriting trees can be finite or infinite. For programs Pi and Pi, any rewriting 
tree will be finite, but program P 3 will give rise to infinite rewriting trees. Once again, our 
structural analysis is fully generic for finite and infinite tree structures at Tier 2, which fits 
our criterion (a). Rewriting trees perfectly reflect the “non-determinism laws” (criterion 
(b)), thanks to and-nodes and or- nodes keeping a structural account of all the search op¬ 
tions. Finally, our structural analysis perfectly fits criterion (c). For productive programs 
like Pi and Pi, the length of a derivation may be infinite, however, each rewriting tree will 
necessarily be finite. This ensures observational approach to corecursion and productivity. 

We complete the picture by introducing the third tier of trees reflecting different search 
strategies arising from substitution into different variables of Tier 2. Given the set Rew(P) 
of all finite rewriting trees defined for program P, a derivation tree is given by a map from 
a tree language L to Rew(P). The arity of a given node in a derivation tree (itself given by 
a rewriting tree) is the number of Tier 2 variables in that rewriting tree. The construction 
of derivation trees is similar to the construction of SLD-derivation trees (as it accounts for 
all possible derivation strategies). The trees of Tier 3 are formally defined in Section [5] 

The resulting Three Tier Tree Calculus (P 3 C) developed in this paper formalises the 
fundamental properties of trees arising in LP proof search. Apart from being theoretically 
pleasing, this new theory can actually deliver very practical results. The finiteness of rewrit¬ 
ing trees comprising a possibly infinite derivation gives an important observational prop¬ 
erty for defining and semi-deciding (observational) productivity for corecursion in LP. This 
puts LP on par with other languages in terms of observational productivity and coinductive 
semantics (Endrullis et a l. 20101 Agda 2015] Coq 2015] >. With a notion of productivity in 
hand for LP, we can ask for results showing inductive and coinductive soundness of deriva¬ 
tions given by transitions among rewriting trees. The two pictures above give, respectively, 
a sound coinductive observation of a proof for t' = from( 0 , [ 0 , [s( 0 ), [s(s( 0 )),...]]]) with 
respect to P 2 , and a sound inductive derivation for nat(s(X)) with respect to P\. Our ongo¬ 
ing and future research based on T 3 C will be further explained in Section[ 6 ] 


2 Background: Tree Languages 

Our notation for trees is a variant of that in, e.g., ( jLloyd 1988) ICourcelle 1983 ). Let N* 
denote the set of all finite words (i.e., sequences) over the set N of natural numbers. The 
length of a word w £ N* is denoted by |w|. The empty word e has length 0. We identify 
the natural number i and the word i of length 1. If w is a word of length /, then for each 
i £ {1,...,/}, wi is the i th element of w. We may write w = w\...wi to indicate that w is a 
word of length /. We use letters from the end of the alphabet, such u,v, and w, to denote 
words in N* of any length, and letters from the middle of the alphabet, such as i,j, and k, 
to denote words in N* of length 1 (i.e., individual natural numbers). The concatenation of 
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e 


0 

/ \ 

00 01 


e 

/ \ 

0 1 
/ \ 

10 


stream 


scons 
/ \ 

0 Y 


scons 

/ \ 

0 scons 

/ \ 

0 


Fig. 2. The two figures on the left depict the finite and infinite tree languages {£,0,00,01} and 
{£,0,1,10,11,...}. The two figures on the right depict the finite term tree stream(scons(X,Y)) and the infi¬ 
nite term tree scons(0, scons(0,...)), both over Ej. 


words w and u is denoted wu. The word v is a prefix of w if there exists a word u such that 
w = vu, and a proper prefix of w if u 7 ^ E. 

Definition 2.1 A set LC N* is a (finitely branching) tree language if the following condi¬ 
tions are satisfied: 

For all w £ N* and all i, j £ N, if w j £ L then w £ L and, for all i < j, wi £ L. 

For all w £ L, the set of all i £ N such that wi £ L is finite. 

A tree language L is finite if it is a finite subset of N*, and infinite otherwise. Examples 
of finite and infinite tree languages are given in Figured We may call a word w £ L a node 
of L. If w = w\W 2 --.wi, then a node w\W 2 ---Wk for k < l is an ancestor of w. The node w is 
the parent of wi, and nodes wi for i £ N are children of w. A branch of a tree language L is 
a subset If of L such that, for all w,v £ If, w is an ancestor of v or v is an ancestor of w. If 
L is a tree language and w is a node of L, the subtree ofL at w is L\w = {v | vvv £ L}. 

We can now define our three-tier calculus 7 'C. 


3 Tier 1: Term Trees 

In this section, we introduce Tier 1 of T'C, highlighting the structural properties of its ob¬ 
jects (arity, branching, variables), the operation of first-order substitution, and the relevant 
calculus given by unification. 

3.1 Tier 1 structural properties: Signature as codomain, arity, and variables 

The trees of T^C’s first tier are term trees over a (first-order) signature. A signature E 
is a non-empty set of function symbols, each with an associated arity. The arity of f £ 
E is denoted arity(f). For example, Ei = {stream, scons, 0}, with arity{ scons) = 2, 
arity( stream) = 1, and arity(0) = 0, is a signature. To define term trees over E, we also 
need a countably infinite set Var of variables disjoint from E, each with arity 0. We use 
capital letters from the end of the alphabet, such as X, Y, and Z, to denote variables in Var. 

Definition 3.1 Let L be a non-empty tree language and let E be a signature. A term tree 
over E is a function t'.L—y E U Var such that, for all w £ L, arity(t(w)) = {; | wi £ Lj |. 

Structural properties of tree languages extend to term trees. For example, a term tree 
t : L —v EU Var has depth depth(t) = max{|w| | w £ L}. The subtree of t at node w is given 
by t ' : (L\w) —» E U V, where t'(v) = t (vvv) for each v £ L\w. 

Term trees are finite or infinite according as their domains are finite or infinite. Term 
trees over E may be infinite even if E is finite. Figure [2] shows the finite and infinite term 
trees stream(scons(X,Y)) and scons(0,scons(0,...)) over Ei. The set of finite (infi¬ 
nite) term trees over a signature E is denoted Term(E) (Term“(E)). The set of all (i.e.. 
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finite and infinite) term trees over E is denoted by Term® (E). Term trees with no oc¬ 
currences of variables are ground. We write GTerm(E) (GTerm“(E), GTerm ffl (E)) for 
the set of finite (infinite, all ) ground term trees over E. GTerm(E) is also known as the 
Herbrand base for E, and GTerm®(E) is known as the complete Herbrand base for E, in 
the literature ( Lloyd 1988[ ). Both GTerm(E) and GTerm <H (E) are used to define the Her¬ 
brand model and complete Herbrand model (declarative) semantics of LP (IKowalski 19741 
Lloyd 1988). Additionally, GTerm“(E) is used to give an operational semantics to SLD- 


computations at infinity in ([Lloyd 1988} Ivan Emden and Abdallah 19851). 


3.2 Tier 1 operation: First-order substitution 

A substitution of term trees over E is a total function <7 : Var —» Term(E). We write id 
for the identity substitution. If <7 has finite support — i.e., if |{X £ Var | <7(X) X}| £ N 

— and if <7 maps the variables X, to term trees f,-, respectively, and is the identity on all 
other variables, then we may write a as {Xi t\ ,..., X„ <->• t„}. The set of all substitutions 
over a signature E is Subst(E). Substitutions are extended from variables to term trees 
homomorphically: if t £ Term(E) and a £ Subst(E), then the application a(t) is defined 
by (a(t))(w) = t(w ) if f(w) 0 Var , and (cr(f))(w) = (ff(X))(v) if w = uv , t(u) = X, and 
X £ Var. Composition of substitutions is denoted by juxtaposition, so \ it) is <72(<7 i (f)). 
Since composition is associative, we write < 73 02 <7| rather than (03 02)01 or ). 


3.3 Tier 1 calculus: Unification 

A substitution a over E is a unifier for term trees t and u over E if ff(f) = c t(m), and a 
matcher for t against u if o{t) = u. A substitution CTi is more general than a substitution 
Ob. denoted <J\ < Oi, if there exists a substitution cr such that <7<7i(X) = <72 (X) for every 
X £ Var. A substitution <7 is a most general unifier (mgu ) for t and u if it is a unifier for t 
and u, and is more general than any (other) such unifier. A most general matcher (mgm) is 
defined analogously. Both mgms and mgus are unique up to variable renaming. 

We write t ~<j u if <7 is a mgu for t and u, and t -< a u if <7 is a mgm for t against u. 
Our notation is reasonable: unification is reflexive, symmetric, and transitive, but match¬ 
ing is reflexive and transitive only. Mgms and mgus can be computed using Robinson’s 
seminal unification algorithm (see, e.g., (Lloyd 1988. Pfenning 2007])). Any standard uni¬ 
fication algorithm (possibly represented by system of sequent-like rules ([Pfenning 2007 


Fu and Komendantskaya 2015|) can be seen as the calculus of Tier 1. Additional details 


about unification and matching can be found in, e.g., (Baader and Snyder 2001). 


4 Tier 2: Rewriting Trees 

In this section, we introduce Tier 2 of T 3 C, highlighting the structural properties of rewrit¬ 
ing trees: codomains comprising term trees and clauses, suitable notions of arity, the oper¬ 
ation of Tier 2 substitution, and the relevant calculus given by rewriting tree transitions. 

4.1 Tier 2 structural properties: Terms and clauses as codomain, arity, and variables 

In LP, a clause C over a signature E is a pair (A, [Bo, ...,B„]), where A £ Term(E) and 
[Bo, .. .B„] is a list of term trees in Term(E). Such a clause C is usually written as A 4— 
Bo, ■ • • ,B n - The head A of C is denoted head(C) and the body Bo,... ,B„ of C is denoted 
body(C). In 7 ’C, a clause over E is naturally represented as a total function (also called 
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C ) from a finite tree language L of depth 1 to Term(E) such that C(e) = head(C), and if 
body{C) is Bo,... ,B„ then, for each i £ L, C(i) = Bj. The set of all clauses over E is denoted 
by Clause(E). A goal clause G over E is a clause ? Bo, ■.. ,B„ over EU {?}. Here, ? is 
a specified symbol not occurring in EU Var, and Bq, ... ,B„ are term trees in Term(E). The 


goal clause ? -<— is called the empty goal clause over E. We consider every goal clause 


over E to be a clause over E. The arity of a clause A Bo, ... ,B„ is n + 1. The symbol 
head(C)(e ) is the predicate of C. 

A logic program over E is a total function from a set {0,1,..., n} C N to the set of non¬ 
goal clauses over E. The set of all logic programs over E is denoted LP(E). The arity of 
P £ LP(E) is the number \dom(P) \ of clauses in P. 

We extend substitutions from variables to clauses and programs homomorphic ally. The 
variables of a clause C can be renamed with “fresh” variables — i.e., with variables that do 
not appear elsewhere in the current context — to get a new a-equivalent clause that can be 
used interchangeably with C. We assume variables have been thus renamed apart when¬ 
ever convenient. Renaming apart avoids circular (non-terminating) cases of unification and 
matching in LP. Under renaming, we can always assume that a mgm or mgu of a clause 
and a term is idempotent, i.e., that oo = a. 

We now define the trees of Tier 2. Rewriting trees allow us to simultaneously track all 
matching sequences appearing in an LP derivation, and thus to see relationships between 
them. Since rewriting trees use only matching in their computation steps, they capture 
theorem proving (i.e., computations holding for all compatible term trees). By contrast, the 
Tier 3 derivation trees defined in Section @ use full unification, and thus capture problem 
solving (i.e., computations holding only for certain compatible term trees). 

We distinguish two kinds of nodes in rewriting trees: and-nodes capturing terms coming 
from clause bodies, and or-nodes capturing the idea that every term tree can in principle 
match several clause heads. We also introduce or-node variables to signify the possibility 
of unification when matching of a term tree against a program clause fails. 

Definition 4.1 Let Vr be a countably infinite set of variables disjoint from Var. If P £ 
LP(E), C £ Clause(E), and c £ Subst(E) is idempotent, then rew(P,C, d) is the function 
T : dom[T) —> Term(E) UClause(E) U Vr, where dom(T ) is a non-empty tree language, 
satisfying the following conditions: 

1. T(e) = ct(C) £ Clause(E) and, for all i € dom(C)\{e}, T(i) = a(C(i)). 

2. Forw £ dom(T) with |w| even and |w| > 0, T(w) £ Clause(E) U Vr. Moreover, 

- ifT(w ) £ Vr, then {j \ wj £ dom(T)} = 0, and 

- ifT(w) = B £ Clause (E), then there exists a clause P(i) and an mgm 0 for P(i) against 
head(B). Moreover, for every j £ dom(P(i))\{e}, wj £ dom(T) andT(wj) = o(Q(P(i)(j))). 

3. For w £ dom{T ) with |w| odd, T(w) £ Term(E). Moreover, for every i £ dom{P), we have 

- wi £ dom(T), and 



<7(9(P(i))) ifhead(P{i )) -<e T(w) and 

afreshX£VR otherwise 


4. No other words are in dom(T). 


A node T (w) of rew (P, C, a) is an or-node if |w| is even and an and-node if |w| is odd. The 
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node T(e) is the root of rew(P, C, a). If P £ LP(E), then T is a rewriting tree for P if it is 
either the empty tree or rew(P,C, o) for some C £ Clause(E) and <J € Subst(E). 

The arity of a node T(w) in T = rew(P, C, cr) is arity(P) if T(w) £ Term(E), arity(C) 
if T(w) £ Clause(E), and 0 if T(w) £ Vr. The role of the parameter a in the definition of 
rew will become clear when we discuss the notion of substitution for Tier 2. For now, we 
may think of a as the identity substitution. 


Example 4.1 The rewriting trees rew(Pi, ? 4— nat(s (X))fd) and rew(Pi, ? 4— nat(s(0 )),id) 
are shown in Figure\T\ 


A rewriting tree for a program P is finite or infinite according as its domain is finite or 
infinite. We write Rew(P) for the set of finite rewriting trees for P, Rew°°(P) for the set of 
infinite rewriting trees for P, and Rew® (P) for the set of all (finite and infinite) rewriting 
trees for P. In (Komendantskaya et al. 2014), a logic program P is called (observationally) 
productive, if each rewriting tree constructed for it is in Rew(P). Programs Pi and Pi are 
productive in this sense, whereas program P3 is not. In future work, we will introduce 
methods that semi-decide observational productivity. 


4.2 Tier 2 operation: Substitution of rewriting trees for Tier 2 variables 

With rewriting trees as the objects of Tier 2 and a suitable notion of a Tier 2 variable, we 
can replay Tier 1 substitution at Tier 2 by defining Tier 2 substitution to be the replacement 
of Tier 2 variables by rewriting trees. However, in light of the structural dependency of 
rewriting trees on term trees in Definition l4.ll we must also incorporate first-order substi¬ 
tution into Tier 2 substitution. Exactly how this is done is reflected in the next definition. 


Definition 4.2 Let P £ LP(L), C £ Clause(E), a,a' £ Subst(E) idempotent, and T = 
rew(P,C, ct). Then the rewriting tree o'(T) is defined as follows: 

for every w £ dom(T) such that T (w) is an and-node or non-variable or-node, (<j'(T))(w) = 
<y'(T(w)). 

for every wi £ dom(T) such thatT{wi) £ Vr, if 6 is an mgm ofhead(P(i)) against o'(T)(w), 
then (o' (T))(wiv) = rew(P, 0(P(f)), c 7 'cj)(v). (Note v = e is possible.) If no mgm ofhead(P(i)) 
against < 7 '(T)(w) exists, then (cr , (P))(wf) =T(wi). 


Both items in the above definition are important in order to make sure that, given a rewriting 
tree T and a first-order substitution a, <j(T) satisfies Definition 14. II 

Example 4.2 Consider the first rewriting tree T of Figure\l\ Given first-order substitution 
CT = {X 0}, the second tree of that Figure gives cr(T). Note that Tier 2 variable Xi is 
substituted by the one-node rewriting tree nat(O) £- as a result. In addition, all occur¬ 
rences of the first-order variable X in T are substituted by 0 in cr(T). 


Drawing from Examples 14.11 and 14.21 we would ideally like to formally connect the 
definition of a rewriting tree and Tier 2 substitution, and say that, given T = rewlP.C.id) 
and a first-order substitution <7, o(T) = rew(P. <j(C),id). However, this does not hold in 
general, as was also noticed in (Komendantskaya et al. 2014). Given a clause C = (f<— 
t\,... ,t n ), we say a variable X is existential if it occurs in some f ; - but not in t. The presence 
of existential variables shows why the third parameter in definition of rew is crucial: 
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? 4— conn(a, c) 
conn(a, c) 


? f- conn(a, c) 


conn a. c) 


Xi ■ —rfbnnjaTc) 4— edgeta.Z'yconnfZfic) Xi X3 X,-connfai c) edgefa,b), connfb, c) XT'" X3 

edgefa^) connfZ'.c) edge(a.b) conn(b,c) 

X4 X5 X(, X-] Xg ' X:, X\q X4 Xg edge(a/b£i^~^ X7 Xg ■ X9 conn(b.c) 4 


Fig. 3. The infinite rewriting trees T and T' for the program P 4 of Example 14.31 the clause conn(a, c), and 
the substitutions id and |z' 1 —> b}, respectively. T offers no proof that P 4 logically entails conn(a, c), but the 
underlined steps in T' comprise precisely such a proof. The figure also illustrates a transition from T to T' 
relative to variable Xg. 


Example 4.3 The graph connectivity program P 4 is given by 
0. conn(X,X) ■£- 

1. conn(X,Y) <— edge(X,Z),conn(Z,Y) 

2. edge(a,b) £- 

3. conn(b. c) £- 

Figure\3\shows rewriting trees T = revj(P 4 ,C,id) and T' = rew (P 4 ,C,d), where C =? •<— 
conn(a, c), and 0 = \ll M- b}. Note that 9(T) = T' but rew^, 0(C), id) ^ T'. This hap¬ 
pens because Clause 1 contains an existential variable Z in its body, and construction of 
rew(F4, 0(C), id) fails to apply the substitution 0 down the tree. 

Given T = rew(.P,C, a), for T' = 9(T) = rew (P,C,9a) to hold, we must make sure 
that the procedure of renaming variables apart used implicitly when computing mgms 
during the rewriting tree construction is tuned in such a way that existential variables 
contained in the domain of 0 are still in correspondence with the existential variables in 
rew(P,C, 6a). We achieve this by introducing a new renaming apart convention to supple¬ 
ment Definition l4.ll Given a program P and a clause P(i) with distinct existential variables 
Zj.... ,Z„ £ Var , we impose an additional condition on the standard renaming apart proce¬ 
dure. During the construction of T = rew( /l C, a), when an and-node T (w) is matched with 
head{P(i)) via 0 in order to form T[wi) = 6 (P{i)), P(i )’s existential variables Z\,... ,Z„ 
must be renamed apart as follows: 

- We partition Var into two disjoint sets called Vy and Vg. The set Vg is used to rename 
existential variables apart, while Vu is used to (re)name all other variables. 

- Moreover, when computing an mgm 0 for T(w) and ( P(i )), every existential vari¬ 
able Z; from Z\ ,... ,Z„ is renamed apart from variables of T using the following indexing 
convention: Z;. n- E}-, with £*■ £ Vg. 

When writing T(wi) = 6 {P{i)) we assume that the above renaming convention is already 
accounted for by 0. This ensures that the existential variables will be uniquely deter¬ 
mined and synchronized for every two nodes T(w) and T'(w) in T = rew(P.C. a) and 
T' = rew(P, C, da). Subject to this renaming convention, the following theorem holds. 

Theorem 4.1 Let P eLP(Z,), C £ Clause(E), and 0, tJ £ Subst(E). Then 0(rew(F,C, ct)) = 
rew(P,C, da). 

Proof. Let T = rew (P,C, a), and let T' = rew(P,C, 0<r). We need to prove that d(T) = T'. 
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The proof proceeds by induction on the length of the tree T and by cases on the types of 
nodes in T and 9(T). 

- If T(w) and 0(T(w)) are non-variable or-nodes (including the case He)), then, by 
Definition ^. 21 9(T)(w) = 9(T(w)) = 9a(C*), where C* is either C (i.e., it is a root node) 
or some P(i) G P. But, by Definition 14. II T'(w) = 6 <J(C*). (Here, the synchronisation of 
renamed existential variables is essential, as described.) 

- If T (w) and 9 (T (w)j are and-nodes, then the argument is similar. 

- If T ( wi) is a variable or-node, then, by Definition l4.2l two cases are possible: 

(1) If no mgm for 9(T(w)) and head(P(i )) exists, then 9(T)(w ) = 9(T(w)). But then 
no mgm for T'(w) and head(P(i )) exists either, so T'(w) = 9(T(w)). 

(2) If the mgm for 9(T(w)) and head(P(i )) exists, then by Definition 14. 21 9(T)(wi) = 
rew(P, 9'(P(i)), e<r)(e), where 9' is the mgm of 9(T(w)) and head(P(i)). The rest of the 
proof proceeds by induction on the depth of rew (71 9'lP( i) ), 9a). 

Base case. For the root 9(T)(wi) = rew(P, 9'(P(i)),9a)(e), by Definition [47] we have 
that rew(P, 9'(P(i)), 9a)(s) = ( 9a)(9'(P(i ))). On the other hand. Definition 14.11 also 
gives that T'(wi) = (9a)(9"(P(i))), where 9" is the mgm of T'(w) and head(P(i)). Since 
T'(w) = ( 9(T))(w ) by the earlier argument for and-nodes, 9' and 9" are mgus of equal 
term trees and head(P(i)), so 9' = 9". Then T'(wi) = ( 9(T))(wi ), as desired. 

Inductive case. We need only consider the situation when T(wivj) is undefined, but 
( 9(T))(wivj ) is defined. By Definition 14.21 9{T)(wivj) = rew(P 1 9 l (P(i)) 1 9a)(vj). This 
node can be either an and-node, a variable or-node, or a non-variable or-node. The first two 
cases are simple; we spell out the latter, more complex case only. 

If 9{T)(wivj) is a non-variable or-node then, by Definition l4. II it must be ( 9a){9*P{j )), 
where 9* is the mgm of 9(T)(wiv ) and head(P(j)). On the other hand. Definition ^. I l also 
gives that T'(wivj) = ( 9a)(9**(P(j ))), where 9** is the mgm of T'iwiv) and head(P{j)). 
Since T'{wiv) = ( 9{T))(wiv) by the induction hypothesis, 9* and 0** are mgms of equal 
term trees and head(P{j)), so 9* = 9**. Thus T'(wivj) = ( 9{T))(wivj ), as desired. □ 

4.3 Tier 2 calculus: Rewriting tree transitions 

The operation of Tier 2 substitution is all we need to define transitions among rewriting 
trees. Let P G LP(E) and t G Term(E). If head{P(i )) L then a is the resolvent of P(i) 
and t. If no such a exists then P(i) and t have null resolvent. A non-null resolvent is an 
internal resolvent if it is an mgm of P(i) against t, and it is an external resolvent otherwise. 
Definition 4.3 Let P G LP( E) and T = rew(P, C, a') G Rew a (P). IfX = T(wi) G Vr, then 
the rewriting tree Tx is defined as follows. If the external resolvent a for P(i) and T(w) is 
null, then Tx is the empty tree. If a is non-null, then Tx = rew (P,C,aa'). 

IfT GRew“(E) and X G Vr, then the computation of Tx from T is denoted Trans (/l T.X) = 
Tx- If the other parameters are clear we simply write T —? T x . The operation T -G T x is a 
tree transition for P and C. A tree transition for P G LP(E) is a tree transition for P and 
some C G Clause(E). A (finite or infinite) sequence T = rew(P.C.id) —>7) —> T2 —> ... of 
tree transitions for P is a derivation for P and C. Each rewriting tree T, in the derivation 
is given by rew (P, C, cr,... Oia \), where <Ti, 02 ■ ■ ■ is the sequence of external resolvents 
associated with the derivation. When we want to contrast the above derivations with SLD- 
derivations, we call them S-derivations, or derivations by structural resolution. 
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Example 4.4 Tree transitions for P\ and P 4 are shown in Figures\J]and\3\ respectively. 


It is our current work to prove that S-derivations are sound and complete relative to 
declarative semantics of LP; see also (Fu and Komendantskaya 2015) for a comparative 
study of the operational properties of S-derivations and SLD-derivations. 


5 Tier 3: Derivation Trees 

While the rewriting trees of Tier 2 capture transitions between Tier 1 term trees that depend 
on matching, the derivation trees of Tier 3 capture transitions between Tier 2 rewriting 
trees that depend on unification. Derivation trees thus allow us to simultaneously track 
all unification sequences appearing in an LP derivation. The arity of a rewriting tree T, 
denoted arity(T ), is the cardinality of the set indices(T) of indices of variables from Vr in 
T. There is always a bijection pos from indices(T) to the (possibly infinite) set arity(T). 


Definition 5.1 If P £ LP(E) and C £ Clause(E), the derivation tree derf/lC) is the func¬ 
tion D : dom(D) —> Rew“(P) such that D(e) = rew(P,C,id), and if w £ dom(D), i £ 
arity(D(yv)), and i = pos{k), then wi £ dom(D) and D(wi) is Trans(P,D(w),24). 


For P £ LP(E) and C £ Clause(E), the derivation tree derf/l C) is unique up to renam¬ 
ing. If P £ LP(E), then I) is a derivation tree for P if it is der (P.C) for some C £ Clause(E). 
A derivation tree is finite or infinite according as its domain is finite or infinite. Inductive 
programs like Pi and coinductive programs like P 2 will have infinite derivation trees, so 
construction of the full derivation trees for such programs is infeasible. Nevertheless, finite 
initial fragments of derivation trees may be used to make coinductive observations about 
various routes for proof search. We are currently exploring this research direction. 

6 Conclusions and Future Work 


This paper gives the first fully formal exposition of the Three Tier Tree Calculus T 3 C for 
S-resolution, relating “laws of infinity”, “laws of non-determinism”, and “laws of observ¬ 
ability” of proof search in LP in a uniform, conceptual way. Implementation of derivations 
by S-resolution is available ( Komendantskaya et al. 2015) . 

The structural approach to LP put forth in this paper relies on the syntactic structure of 
programs rather than on their (operational, declarative, or other) semantics. In essence, it 
presents an LP analogue of the kinds of reasoning that types and pattern matching support 
in interactive theorem proving (ITP) ( jAgda 20T5||Coq 2015| ). Further study of this analogy 
is an interesting direction for future research. 

Our next steps will be to formulate a theory of universal and observational productivity 
of (co)recursion in LP, and to supply T 3 C with semi-decidable algorithms for ensuring pro¬ 
gram productivity (akin to guardedness checks in ITP). Formally proving that S-resolution 
is both inductively and coinductively sound is another of our current goals. 

Since LP and similar automated proof search methods underlie type inference in ITP 
and other programming languages, S-resolution also has the potential to impact the design 
and implementation of typeful programming languages. This is another research direction 
we are currently pursuing. 
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